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We develop a scheme for fault-tolerant quantum computation based on asymmetric Bacon-Shor 
codes, which works effectively against highly biased noise dominated by dephasing. We find the 
optimal Bacon-Shor block size as a function of the noise strength and the noise bias, and estimate 
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based on gate teleportation, are well suited for hardware platforms with geometrically local gates 
in two dimensions. 
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I. INTRODUCTION 

The theory of fault-tolerant quantum com- 
putation [TJH] nas established that noisy quan- 
tum computers can operate reliably provided 
the noise is neither too strong nor too strongly 
correlated. In a fault-tolerant quantum cir- 
cuit, carefully designed gadgets process logical 
qubits protected by quantum error-correcting 
codes. 

Typical fault-tolerant gadgets are designed 
to work effectively against generic noise with- 
out any special structure. But in some physi- 
cal settings, the noise is expected to be highly 
biased, with dephasing in the computational 
basis far more likely than bit flips. This paper 
addresses how noise bias can be exploited to 
improve the reliability of fault-tolerant quan- 
tum circuits. 

Specifically, we have in mind a setting in 
which, to an excellent approximation, the 
computational basis states {|0),|1)} are the 
energy eigenstates for an unperturbed qubit, 
and single-qubit gates which are diagonal in 
this basis can be performed by adiabatically 
adjusting the energy splittings between these 
states. Similarly, diagonal two-qubit gates are 
performed by adjusting the energy splittings 
of the four (approximate) energy eigenstates 
{|00),|01),|10),|11)}. Various noise sources 
may induce fluctuations in these energy spac- 
ings, and pulse imperfections may cause gates 
to be over-rotated or under-rotated; in either 
case the noisy gate deviates from the ideal 
gate, but remains diagonal. Other physical 
processes may induce transitions between en- 



ergy eigenstates, but we will assume that this 
non-diagonal noise is far weaker than the dom- 
inant diagonal noise. 

Our scheme for fighting biased noise is based 
on asymmetric Bacon-Shor codes [3H5] . These 
quantum codes combine together a length- 
m repetition code protecting against dephas- 
ing and a length-n repetition code protect- 
ing against bit flips; by "asymmetric" we 
mean m > n, so the code works more ef- 
fectively against dephasing than against bit 
flips. A complete universal set of fault-tolerant 
gates can be constructed using only diagonal 
gates, plus single-qubit measurements in the 
X eigenstate basis {|±) = ^ (|0) ± |1))} and 
preparation of single-qubit |+) states. 

In most previous studies of quantum fault- 
tolerance based on Bacon-Shor codes, these 
codes have been concatenated to build a hier- 
archy of codes within codes [HI E] ■ Here we will 
mainly study the performance of a single large 
block code rather than a concatenated scheme. 
For the Bacon-Shor code family, used without 
concatenation, there is no accuracy threshold; 
rather, for a fixed value of the noise strength 
and noise bias, there is an optimal block size 
which achieves the best performance. 

Our gadget constructions and analysis ex- 
tend the results in [SJ [5] , where the case n = 1 
was regarded as the bottom layer of a concate- 
nated code. In contrast to JHUH], we consider 
the case where the gates are required to be ge- 
ometrically local in a two-dimensional array. 
In the geometrically local case, logical blocks 
can be transported as needed via teleporta- 
tion, with the fundamental operations still lim- 
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ited to diagonal gates, X measurements, and 
|+) preparations. 

We have obtained upper bounds on the op- 
timal performance and overhead cost of log- 
ical gates by deriving analytic formulas, and 
our estimates are far from tight. We expect 
that significantly better results could be ob- 
tained using numerical simulations assuming 
independent biased noise, which we have not 
attempted. In particular, our analysis includes 
quite conservative estimates of the probability 
of failure for preparations of cat states that are 
used for measurements of logical Pauli opera- 
tors. 

Another particularly attractive approach to 
achieving quantum fault tolerance with ge- 
ometrically local gates is based on topolog- 
ical codes [TUHT2"] . which unlike Bacon-Shor 
codes have an accuracy threshold and hence 
can reach arbitrarily low error rates per logi- 
cal gate. Topological codes can be adapted for 
optimized protection against biased noise, and 
they perform well against biased noise even 
without any such adaptation. Our current re- 
sults do not conclusively identify noise param- 
eter regimes for which Bacon-Shor codes are 
clearly superior to topological codes. However, 
our scheme has several appealing features - 
for example, the optimal error rate per log- 
ical gate is achieved with a relatively mod- 
est number of physical qubits per code block, 
and is not too adversely affected when qubit 
measurements are noisier than quantum gates. 
Furthermore, only relatively modest classical 
computational resources are needed to inter- 
pret error syndromes. 

We review Bacon-Shor codes in Sec. II, de- 
scribe our biased noise model in Sec. Ill, and 
construct our fault-tolerant gadgets in Sec. 
IV. We obtain upper bounds on the perfor- 
mance of our fault-tolerant logical CNOT gate 
in Sec. V, repeat the analysis in Sec. VI for 
the case where two-qubit gates are required to 
be geometrically local, and report numerical 
values in Sec. VII. In Sec. VIII and IX we 
discuss the state injection and state distilla- 
tion procedures needed to complete our fault- 
tolerant universal gate set. 

Fault-tolerant gadgets protecting against bi- 
ased noise have also been discussed previously 
in [TMES- 



II. BACON-SHOR CODES 

Bacon-Shor codes are quantum subsystem 
codes which are constructed by combining 
together two quantum repetition codes, one 
protecting against Z (phase) errors and the 
other protecting against X (bit flip) errors. 
We will consider using Bacon-Shor codes to 
protect quantum information against biased 
noise, such that Z errors are much more com- 
mon than X errors; therefore, the length m 
of the code protecting against Z errors will be 
longer than the length n of the code protecting 
against X errors. In this case we say that the 
code is asymmetric. The code protects one en- 
coded qubit in a block of nm physical qubits. 

It is convenient to arrange the physical 
qubits in a rectangular lattice with n rows and 
m columns. The code has (m — 1) Z-type sta- 
bilizer generators, or check operators, which 
can be measured to determined a syndrome 
for detecting X errors, and n — 1 A-type sta- 
bilizer generators, which can be measured to 
determine the syndrome for Z errors. Each 
Z-type check operator is a weight-2m opera- 
tor applying Z to each qubit in a pair of adja- 
cent rows, and each A-type check operator is a 
weight-2n operator applying X to each qubit 
in a pair of adjacent columns. Because each 
Z-type check operator "collides" with each A- 
type check operator at four lattice sites, the 
Z-type and A-type check operators are mutu- 
ally commuting and can be measured simulta- 
neously. The logical Pauli operator Z L acting 
on the encoded qubit is a tensor product Z® m 
acting on all the qubits in a (long) row; all rows 
are equivalent because a product of two such 
row operators is in the code stabilizer. The op- 
erator Z L collides with each A-type check op- 
erator at two sites; therefore it commutes with 
these check operators and hence preserves the 
code space. The logical Pauli operator X L is 
a tensor product X® n acting on all qubits in a 
(short) column; again the columns are equiva- 
lent, and X L commutes with the Z-type check 
operators. The operators Z L and X L collide 
at a single site and hence anticommute. 

Assuming that m and n are both odd, the 
code can correct (m — l)/2 Z errors and 
(n — l)/2 A errors. To perform a decoded X L 
measurement, we can measure all nm qubits 
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FIG. 1: Bacon-Shor qubits arranged in an n x m 
lattice. 



in the X basis, compute the parity of the out- 
comes for each of the m columns, and then 
perform a majority vote on the m column par- 
ities. The decoded measurement fails only if 
(m+l)/2 columns each have at least one Z er- 
rors. Similarly, we can perform a decoded Z L 
measurement by measuring all qubits in the Z 
basis, computing the parity of the outcomes 
for each of the n rows, and then performing a 
majority vote on the n row parities. This de- 
coded measurement fails only if (n + 1)/2 rows 
each have at least one X error. 

The code also has a gauge algebra of Pauli 
operators that commute with the check opera- 
tors and with the logical operators, though the 
gauge operators do not necessarily commute 
with one another. The Z-type gauge algebra 
is generated by weight-two operators with Z 
acting on a pair of neighboring qubits in the 
same column, and the X-type gauge algebra 
is generated by weight-two operators with X 
acting on a pair of neighboring qubits in the 
same row. Thus each X-type check operator 
is a product of n weight-two gauge operators, 
and each Z-type check operator is a product of 
m weight-two gauge operators. Conveniently, 
then, the error syndrome can be determined by 
measuring only weight-two gauge operators, 
and furthermore each of these gauge operators 
is geometrically local, acting on a pair of neigh- 
boring qubits. Measuring an X-type gauge 
operator disturbs the values of some Z-type 
gauge operators (and vice versa) , but without 
inflicting any damage on the protected logical 
subsystem. 
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FIG. 2: X-type stabilizer operator (top) and Z- 
type stabilizer operator (bottom) 



III. NOISE MODEL 

We consider a local stochastic biased noise 
model, in which we distinguish between a rate 
e for dephasing faults and a (perhaps much 
lower) rate e' for general faults in diagonal 
gates. The ratio R = e/e' will be called the 
noise bias. For simplicity we will assume that 
e is also the error rate for single-qubit prepa- 
rations and measurements in most of the for- 
mulas we display, though we will treat the 
case where preparations and measurements are 
noisier than diagonal gates in Sec. |VII Cj A de- 
phasing fault, which we will also call a diagonal 
fault, is a trace-preserving completely positive 
map such that all Kraus operators are diagonal 
in the computational basis; for a general fault, 
which we will also call a non-diagonal fault, 
the Kraus operators are unrestricted. We as- 
sume that the sum of the probabilities for all 
fault paths with r dephasing faults and s non- 
dephasing faults at specified circuit locations 
is no greater than e r {e') s . 



IV. FAULT-TOLERANT GADGETS 

The construction of fault-tolerant gadgets 
for universal quantum computation follows 
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Ref. [5]. We implement the logical gate set 

Scss = {P| i +) ,P| L 0>) Mi,MiCNOT i } (1) 

using the physical gate set 

{V l+) ,M X ,CZ}; (2) 

here M.x, M-z denote measurements of the 
Pauli operators X and Z, V\+\, V\q\ denote 
preparations of the X and Z eigenstates with 
eigenvalue 1, and CZ denotes the two-qubit 
controlled phase gate 

CZ = exp(-i|(Z-/)® (Z- /)) , (3) 

which is diagonal in the computational 
(Z-cigenstate) basis |00>, ]01>, jlO), jll) with 
eigenvalues (1,1,1,-1). A diagonal gate like 
this one can be realized by adiabatically per- 
turbing the energy splittings of the computa- 
tional basis states, and hence could plausibly 
have highly biased noise. 

The CZ gate acts symmetrically on the two 
qubits, and, like any Clifford gate, it can be 
usefully characterized by how it propagates 
Pauli errors: 

CZ : XI -> XZ, IX ZX, 

ZI -> ZI, IZ IZ. (4) 

Of course, CZ commutes with Z acting on ei- 
ther qubit, and propagates an X acting on ei- 
ther qubit to a Z error acting on the other. 
These properties are nice if Z errors dominate 
over X errors and we use a code that corrects 
Z errors more effectively than X errors. The 
CZ gates do not propagate the relatively com- 
mon Z errors, and though X errors do propa- 
gate they are transformed to Z errors, which 
the code handles more readily. 

To achieve universal quantum computa- 
tion, we supplement the logical gate set 
Gqss w ith high-fidelity encoded states \i) = 
^(|0>+i|l» and \T) = ^(| )+e"/ 4 |l)), 
which can be prepared using state injection 
and distilled using the Gcss operations [16]. 
Gates that complete a universal set can then 
be constructed using these states and the Gqss 
gates [I?] . 



A. Mx gadget. 

To perform a destructive M x measurement 
we first perform independent Mx measure- 
ments on all qubits in the code block, and then 
compute the parity of the n measurement out- 
comes in each column. Finally a majority vote 
of the to column parities yields the logical mea- 
surement outcome. The result agrees with an 
ideal measurement if the sum of the number of 
Z errors in the block and the number of faulty 
M.x measurements is no larger than (m— 1)/2. 

B. M z gadget. 

A non-destructive Ai^ measurement 
may be performed by measuring (non- 
destructively) the Z-basis parity of each 
length-m row of the code block, and taking a 
majority vote of the n row outcomes. (We say 
the measurement is "non-destructive" if the 
ideal measurement procedure leaves encoded 
Z L eigenstates intact.) In principle the row 
parity measurement could be done by prepar- 
ing a single ancilla qubit in the state |+), 
performing to CZ gates in succession acting 
on the ancilla qubit and the to qubits in the 
row, and finally measuring the ancilla qubit in 
the X basis. However, this procedure is not 
fault-tolerant for two reasons. First, a single 
Z error acting on the ancilla qubit can flip 
the measurement outcome. Second, a single 
X error acting on the ancilla can propagate 
multiple times, producing a high-weight Z 
error acting on the qubits in the row. 

This second problem can be addressed by 
replacing the single-qubit ancilla by an to- 

qubit cat state |+ cat ) = -±= (\of m + |l)® m ), 

so that each ancilla qubit interacts via a CZ 
gate with only one data qubit, limiting the er- 
ror propagation. The fault-tolerant prepara- 
tion of the cat state is discussed below. After 
the to CZ gates, the cat state is read out in 

the basis |± cat ) = i (\0f m ± |l)® m ) to de- 

termine the row parity. This measurement of 
X® m is performed destructively by measuring 
each of the to ancilla qubits in the X basis and 
computing the parity of the outcomes. 

To address the first problem (that a single 
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fault can flip the measured parity), the par- 
ity measurement is repeated r times for each 
row, each time with a fresh cat state, and the 
majority of the results is computed. The rep- 
etition provides protection against Z errors in 
the ancilla, but of course there may also be 
(relatively rare) X errors acting on the data 
qubits due to faults during the Z L measure- 
ment circuit. These faults cause a logical X L 
error only if X errors occur in a majority of 
the n rows. 

Actually, we find that, depending on the 
rates for diagonal and non-diagonal faults, it 
may be advantageous to use a shorter ancilla 
prepared in a cat state of length p, where 
1 < p < m. In that case some ancilla qubits in- 
teract with more than one data qubit, increas- 
ing the danger of error propagation, but on the 
other hand errors are less likely to occur dur- 
ing the preparation of the (shorter) cat state. 
However, using a length-m cat state has the 
significant advantage that the Ai^ measure- 
ment can be conveniently executed using geo- 
metrically local gates on the two-dimensional 
lattice, as we will discuss in Sec. fvT 



as we will discuss in Sec 
This procedure for measuring Z 



can easily 

be extended to build gadgets that perform the 
parity measurements M% z , -Mzzz 011 multi- 
ple logical qubits. 



C. Cat state preparation 

The length-p cat state |+ cat ) = 

^ (|0)® p + |I)® P ) may be characterized 

as the simultaneous eigenstate with eigen- 
value 1 of X® p and of p — 1 ZZ operators, 
each one acting on a pair of neighboring 
qubits. To prepare the cat state, we start 
by preparing the product state \+)® p (an 
eigenstate of X® p ), and proceed to mea- 
sure the p — I ZZ operators (each of these 
measurements is executed using yet another 
ancilla qubit). If every measurement has 
the outcome ZZ = +1, then the cat state 
has been prepared successfully, assuming the 
measurements are flawless. Otherwise, the 
measurement outcomes provide a syndrome 
pointing to X errors in the cat state (each 
ZZ = — I outcome identifies the endpoint of 



a chain of X errors). These X errors in the 
cat state need not be corrected; instead we 
can keep track of their propagation as the 
computation proceeds. Specifically, the X 
errors in the cat state propagate to become Z 
errors acting on the measured row of the data 
block, which should be taken into account 
when we decode a later X L measurement of 
that block. Following Knill [T7j, we say that 
the measured syndrome is used to update the 
"Pauli frame" of the computation. 

To improve robustness, we add one more 
(redundant) ZZ measurement acting on the 
first and last qubit in the cat state. Then 
an even number among the p ZZ measure- 
ments should have the outcome —I if all mea- 
surements are correct; if in fact we find an 
odd number of —1 outcomes than we reject 
the syndrome. (Adding further redundant 
checks would further improve the reliability 
of the syndrome extraction, though we will 
not include these.) We conduct r' rounds 
of syndrome measurement, including rejected 
rounds, so that the number of accepted rounds 
may be fewer than r' . 

In principle, the cat state Pauli frame could 
be determined by performing a perfect match- 
ing on a two-dimensional graph representing 
the space-time history of the cat state syn- 
drome measurement [10] . Instead, we will con- 
sider a much simpler and less effective method 
for decoding the syndrome history. Though far 
from optimal, our procedure has two advan- 
tages over the perfect matching algorithm — it 
requires only very modest (classical) computa- 
tional resources compared to perfect matching, 
and it is also much easier to study analytically. 

In our unsophisticated scheme, we deter- 
mine the Pauli frame using whichever syn- 
drome occurs most frequently among the ac- 
cepted rounds, which we call the "winning" 
syndrome. (Our analysis does not depend 
on what rule is used when two or more syn- 
dromes tie for the distinction of being most 
frequent, since we pessimistically assume that 
the preparation gadget fails catastrophically in 
the event of such a tie.) The state preparation 
protocol is deterministic in the sense that an- 
cillas are never discarded, and the number of 
rounds of syndrome measurement is always r' 
irrespective of the measurement outcomes. 
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FIG. 3: Gadget for a single row of the non-destructive M z measurement, where the cat state ancilla 
length p is equal to the row length m. 



Since the CZ gates commute with each 
other, we can use a trick to reduce the risk of 
undetected X errors in the ancilla propagat- 
ing to become Z errors in the data. Instead 
of preparing the cat state first and then using 
the cat state to measure the parity of a row, 
we perform the ZZ measurements on the cat 
state after the ancilla interacts with the data. 
The resulting complete circuit for measuring 
the row parity is shown in |Fig. 3) 

Although the cat state is "prepared" after 
its "use," X errors in the ancilla during the 
preparation can still cause trouble. The X er- 
rors cause the cat state syndrome to evolve as 
it is measured repeatedly, and therefore the 
syndrome that occurs most often, even if valid 
when measured, may differ from the syndrome 
that is applicable when the cat state interacts 
with the data. The effect of this syndrome 
"drift" needs to be included in our error anal- 



ysis. 



D. 



V\ +) gadget. 



The logical state |+) , for a particular 
"choice of gauge," is a product of m length- 
n cat states, one for each column of the block. 
This state is an eigenstate with eigenvalue 1 of 
X L = X® n acting on any column, and of the 
gauge operator ZZ acting on a pair of neigh- 
boring qubits in any column (and hence also 
a +1 eigenstate of all the Z-type check op- 
erators). We prepare these m length- n cat 
states in parallel, using the same procedure for 
preparing length-p cat states described above. 
In this case we repeat the syndrome measure- 
ment for each column r + times. 
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E. CNOT L gadget. 



The gadget shown in Fig. 4 applies a fault- 
tolerant "teleported" CNOT 1 ' logical opera- 
tion, up to a known Pauli error determined 
by the measurement outcomes [5] . Aside from 
executing the logical gate, the gadget also 
achieves error correction by measuring both 
incoming blocks and preparing fresh outgoing 
blocks. 

To better understand how the gadget works, 
it is helpful to divide it into two parts. One 
part is the "one-bit teleportation" gadget 
shown in |Fig. 5| which performs error cor- 
rection on a single block. This circuit trans- 
fers the single-qubit state \tp) from qubit 2 
to qubit 1. We prepare qubit 1 in the state 
|+) = ^7g(|0) + |1)), perform the joint mea- 
surement Mz 1 z 2 on the two qubits, then per- 
form the single-qubit measurement M X 2 - Af- 
ter applying Pauli operators conditioned on 
the two measurement outcomes, the result is 
that the input state |^>) of qubit 2 is moved 
to qubit 1. The conditional X and Z gates in 
this teleportation gadget need not actually be 
executed; instead the Pauli frame is updated 
to reflect the measurement outcomes. 

The other component of the CNOT gadget 
is shown in |Fig. 6| where again the Pauli frame 
updates are explicitly indicated. We may ver- 
ify that this circuit realizes a CNOT gate by 
checking that it propagates Pauli operators ac- 
cording to the rule 



CNOT : XI 
ZI 



XX, IX 
> ZI, IZ 



IX, 

ZZ. (5) 



We denote the measurement outcomes of Mx 
and Mzzz as (— l) a and (— l) b respectively. 
Since |+) is an eigenstate of X with eigen- 
value +1, XI acting on the input qubits is 
equivalent to XIX acting on the three qubits 
in the circuit, which commutes with Mzzz 
and hence becomes XX acting on the out- 
put qubits. Likewise, IX acting on the input 
qubits is equivalent to I XX, which commutes 
with Mzzz and becomes (— l) a IX acting on 
the output qubits using the measurement out- 
come I XI = (—1)". ZI acting on the in- 
put qubits commutes with Mzzz and trivially 
maps to ZI acting on the output qubits, while 



IZ acting on the input qubits becomes IZI, 
which maps to (-l) b ZIZ using the measure- 
ment outcome ZZZ = ( — l) 6 . Hence we have 
found 



CNOT : XI -> XX, IX -> (-l) a IX, 
ZI -> ZI, IZ -> {-l) b ZZ, 



(6) 



which is corrected to Eq.([5]) by applying Z a 
to both output qubits and X b to the second 
qubit. When we combine the CNOT gate 
with one-bit teleportation of the first qubit, 
the measurements Mzz and Mzzz commute 
and hence can be done in either order. 

Four different versions of the CNOT gadget 
are shown in |Fig. 4| Let us denote the blocks 
appearing in the figure as blocks 1, 2, 3, 4, 
counting from the top down. Version (a) of 
the CNOT gadget contains the preparations 
and measurements "P| + )[l], 7 ? i + \[4], .Mzz [12], 
M Z zz[2M], Mx[2], Mx[3]. This gadget 
has some arbitrary features which are ex- 
ploited to construct the other versions. Since 
ZZ[Y1]ZZZ[IM] = ZZZ[2M], we extract the 
same parity information if .Mzzz [234] is re- 
placed by M zzz [134]; also, since Mzz acts 
symmetrically on its two measured blocks, we 
may permute the blocks, in effect replacing 
V\ +) [1] and M x [2] by V\ +) [2] and M x [1] , thus 
obtaining version (b). Similarly, since Mzzz 
acts symmetrically, we may replace "P\+) [4] 
and M X [S\ by 7>|+>[3] and M x [4\, thus ob- 
taining versions (c) and (d). In all these re- 
alizations of the gadget, the upper incoming 
block is the control block and the lower in- 
coming block is the target block; by replac- 
ing Mzz[l2] and Mzzz[234] by M Z zz[^3} 
and .Mzz [34] we can reverse the order of the 
CNOT. 

This freedom to choose the incoming control 
block to be either block 1 or block 2, and to 
choose the incoming target block to be either 
block 3 or block 4, will be useful when we con- 
sider in Sec. |VI| how to realize the CNOT gad- 
get using geometrically local gates. No matter 
how we configure the gadget, one block par- 
ticipates in both Mzz and Mzzz - However, 
for versions (a) and (c) in Fig. 4 the twice 



measured block is an incoming (control) data 
block, while for versions (6) and (d) the twice 
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FIG. 4: Four versions of the CNOT gadget. 
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FIG. 5: One-bit teleportation circuit. 
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FIG. 6: Teleported CNOT gate. 



measured block is an outgoing (control) data 
block. 

The CNOT L gadget is characterized by four 
parameters: the length p of the cat states used 
in the Z-type measurements and the num- 
ber of times measurements are repeated (r+ 
for the |+) L preparation, r' for the cat state 
preparation, and r for ZZ L and ZZZ L mea- 
surements). In principle we could choose dif- 
ferent values of these parameters in the mea- 
surements M. zz and M. zzz , but in practice 
we have found it optimal or near optimal to 
use the same parameters for both. 



F. Scheduling 

If geometrically nonlocal gates are allowed, 
and fresh ancilla qubits are always available 
on demand, then we can schedule the CZ 
gates so that no data qubit or ancilla qubit 
is idle during any time step. We assume that 
a preparation, measurement, or CZ gate can 
be performed in a single time step. Suppose, 
for example, that we use the gadget design 



Fig. 



where the outgoing control data 
block participates in both and M^ zz , 

and let us also assume that the [12] cat states 
have length 2m while the [234] cat states have 
length 3m, matching the weight of the mea- 
sured operators. We label the r ancilla regis- 
ters used to measure a row operator r times by 
a E {1,2,3, ... ,r} for the ZZ L measurement 
and by b G {1, 2, 3, ... , r} for the ZZZ L mea- 
surement. Suppose that Vy^ [2] is completed 

in time step -1 and that 'Py^ [3] is completed in 
time step 0. Then the CZ gates are scheduled 
as follows: The [12] ancilla a interacts with 
block 2 in time step a — 1 and with block 1 in 
time step a, while the [234] ancilla b interacts 
with blocks 2, 3, and 4 in time state b. Hence 
processing of the incoming blocks (1 and 4) 
begins in time step 1, and processing of the 
outgoing blocks (2 and 3) ends in time step r. 

The "preparation" of the [234] ancilla b be- 
gins in time step 6+1, but the "preparation" 
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of the [12] ancilla a is "staggered." The syn- 
drome measurement for the first m qubits in 
ancilla a (those that interact with block 1) be- 
gins in time step a + 1, while the syndrome 
measurement for the last m qubits (those that 
interact with block 2) begins one step earlier 
(step a). Despite this one time step delay, no 
qubits are ever idle — the ancilla qubit used 
to measure syndrome bit Z[m\Z[m + 1] in cat 
state a interacts with cat state qubit m + 1 
in time step a and interacts with cat state 
qubit m in time step a + 1. Meanwhile, the 
ancilla qubit used to measure the syndrome 
bit Z[l]Z[2m] in cat state a interacts with cat 
state qubit 2m in time step a + 1 and with cat 
state qubit 1 in time step a + 2, etc. 

If instead we use the CNOT gadget design in 



Fig. 4 1, where the incoming control data block 
participates in both Ai Z z anc ^ M zzz , the 
processing of the data blocks is not perfectly 
synchronized. If M.\ z is performed first, then 
the incoming and outgoing control blocks are 
one time step ahead of the incoming and out- 
going target blocks, while if M.\ zz is per- 
formed first it is the other way around. Thus 
if the incoming control block is either one step 
ahead or one step behind the incoming tar- 
get block, we can perform the CNOT without 
leaving any qubits idle, while maintaining the 
one-step lag between the blocks. But if the two 
incoming blocks are synchronized, then using 
the gadget in |Fig. 4fc one block or the other 
would have to wait for one time step before 
the processing of the block begins. 



G. Injection by teleportation 

To complete a universal set of protected 
gates, we will use the magic state distillation 
procedure introduced by Bravyi and Kitaev 
[16]. First we inject non-Clifford states into 
the code block using teleportation, then we im- 
prove the fidelity of these encoded states via 
distillation. The distillation protocol is dis- 
cussed further in Sec. 



IX here we briefly de- 



scribe the state injection procedure, with fur- 
ther details postponed until Sec. |VIII| 

The state injection makes use of the "one- 
bit teleportation" circuit depicted in |Fig. 5| To 
inject a state into the Bacon-Shor block, first 



the state \tp) is prepared using a single physi- 
cal qubit, while a logical block is prepared in 
the state Then the weight-(m + 1) oper- 

ator Z L Z is measured on the single qubit and 
a single row of the block, and finally the single 
qubit is measured in the X basis. This pro- 
cedure prepares the block in the state \ip) L , 
up to a logical Pauli operator known from the 
measurement outcomes. 

There are other possible injection proce- 
dures, but this one is particularly simple. A 
single fault in the circuit can cause failure. 



V. EFFECTIVE ERROR STRENGTH 

The CNOT gadget may produce a logical 
error if any one of its four measurements de- 
viates from the ideal case. To estimate the 
failure probability for the CNOT gadget, we 
must enumerate the ways in which these mea- 
surements might fail. 

One potential source of trouble is the prepa- 
ration of cat states and the logical |+) L states. 
Recall that in these preparation gadgets a 
syndrome is measured repeatedly, and that 
the "winning" (i.e., most frequently occur- 
ring) syndrome measurement result is used to 
update the Pauli frame. This winning syn- 
drome might differ from the syndrome that 
would have been inferred in an ideal gadget 
because of repeated Z errors that induce er- 
rors in the syndrome measurement, because of 
X errors that cause the syndrome to evolve be- 
tween rounds, or because of some combination 
of the two. We say that the preparation gadget 
"succeeds" if the winning syndrome matches 
the syndrome that could have been obtained 
by an ideal measurement at some stage dur- 
ing the multiple rounds of syndrome measure- 
ment, and if furthermore fewer than half of 
the cat state qubits are afflicted by X errors 
during the preparation process. Otherwise the 
preparation gadget "fails." 

Failure of a preparation gadget can be catas- 
trophic, because the inferred Pauli frame may 
differ from the actual Pauli frame by a high- 
weight X-type error. If it occurs in a cat state, 
this high-weight X error could propagate to 
the data, causing a logical Z L error. If it oc- 
curs in the preparation of a logical block, the 
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error could flip the value of a subsequent Z- 
type logical measurement. 

On the other hand, even if a cat state prepa- 
ration succeeds, the winning syndrome might 
not coincide perfectly with the ideal syndrome 
that applies to the cat state at the time it 
interacts with the data. The syndrome may 
drift due to X errors that accumulate during 
the many rounds of syndrome measurement. 
However, the inferred Pauli frame differs from 
the ideal Pauli frame by a number of X er- 
rors that is no larger than the number of non- 
diagonal faults that occur inside the prepara- 
tion gadget. Though the inferred Pauli frame 
may not be exactly right, it is not catastroph- 
ically wrong unless there are many X errors in 
the circuit. 

Aside from errors in the measured cat state 
syndrome, a M. zz or M. zzz could be faulty 
because of Z errors acting on the cat state 
qubits, which flip the outcomes of row mea- 
surements. Because each row measurement is 
repeated r times, multiple errors are required 
for a row measurement to fail. Alternatively, 
an X error acting on a data qubit could flip 
the outcome of a row measurement. Because 
a majority vote is performed on n row mea- 
surement outcome, again multiple errors are 
required for the logical Z-type measurement 
to fail. An Ai x could be faulty because of Z 
errors on data qubits, which flip outcomes of 



subsequent X measurements on those qubits. 
Because a majority vote is performed on the 
m column parities, multiple errors are required 
for the M x to fail. 

In estimating the failure probability for 
A4 X , M zz , or M zzz , we need to take into 
account possible faults in circuit locations pre- 
ceding the CNOT gadget, which might cause 
these measurements to deviate from their ideal 
outcomes. In our fault-tolerant scheme, a logi- 
cal CNOT gate is either preceded by teleporta- 
tion into the code block or by another CNOT 
gate; in the worst case, each input block to the 
CNOT gadget is an output block from an im- 
mediately preceding CNOT gadget. Thus, in 
the preceding gadget, each block is prepared 
in the state |+) L , and then subjected to either 
M-zz an d M zzz (if it is the control block of 
the CNOT gadget) or just M zzz (if it is the 
target block of the CNOT gadget). 

A. Measurement failure 



We will assume that failure of any of its 
preparation gadgets will cause the CNOT to 
fail, and we denote by P* TI the failure proba- 
bility for a logical measurement due to a cause 
other than failure of a preparation gadget. 
Thus, the failure probability for the CNOT 
can be bounded as 



P crr (CNOT i ) < P;„{M L ZZ ) + P;„(M L ZZZ ) + 2P C UM L X ) 

+ 4P C rr('P|+>) + -Pcrr("P|ZZ-Cat)) + ^orr^l^ZZ-Cat) ) 

+ 2 (^Pcrr{'P\ZZ-Ca,t)) + ^crr (P\ ZZZ-C&t) )) ■ (?) 



The last term accounts for preparation er- 
rors that may have occurred in a M. Z z 01 
M. Z zz immediately preceding the current 
CNOT gadget, acting on either one of the two 
input blocks to the current gadget. In the 
worst case, both incoming blocks were con- 
trol blocks in the preceding CNOT gadgets, 
and were therefore subjected to both M. Z z 
and M. zzz . The coefficient of P orr ('P| + )) is 4 



rather than 2 for a similar reason — the logical 
measurement outcomes in the current CNOT 
gadget may differ from ideal outcomes due to 
errors in the preparation of logical blocks ei- 
ther in the current gadget or in one of the two 
preceding gadgets. 

Before considering the probability of a 
preparation failure, let us estimate P* rl for 
each of the three types of logical measure- 
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merits. For an M.\ measurement to fail, more 
than half of the m length-n columns must 
each have at least one Z error. In the worst 
case, each qubit in the block participates in 
3r + 2r + + 2 circuit locations: a preparation, a 
measurement, 2r + CZ gates contained in the 
preparation of the logical \+) L state, r CZ 
gates in the logical Z measurement in the cur- 
rent gadget, and 2r CZ gates in two logical Z 
measurements in the preceding gadget. Each 
of these locations could be faulty with prob- 
ability e + e', since either a diagonal of non- 
diagonal fault could produce a phase error. 

For the moment, let us assume for simplicity 
that the cat state has length 2m for the ZZ L 
measurement and length 3m for the ZZZ L 
measurement, so that no ancilla qubit inter- 
acts with more than one data qubit. (We 
will consider the case where the cat state is 
Because the preparation 



shorter in Sec. VC 



of the cat state actually occurs after the an- 
cilla interacts with the data, we do not need to 
worry about X errors arising in the cat state 
preparation propagating directly to the data. 
We do need to worry that the Pauli frame in- 
ferred from the "winning" syndrome after r' 
rounds of syndrome measurement actually dif- 



fers from the ideal Pauli frame. However, as- 
suming that the syndrome is decoded correctly 
(as it will be if the preparation succeeds), a 
Pauli frame X error afflicts a qubit only if at 
least one X error acted on that qubit during 
the multi-round syndrome measurement. The 
probability of such a Pauli-frame X error on 
any qubit in the cat state is therefore bounded 
above by 2r's' , since two CZ gates act on each 
qubit in each round of syndrome measurement, 
and the syndrome measurement is repeated r' 
times in the cat state preparation. Because 
each row measurement is repeated r times, the 
total probability that a row measurement fails 
due to a Pauli-frame X error in a cat state is 
bounded above by 2rr'e'. Furthermore, in the 
worst case there are three logical Z L measure- 
ments between the preparation and measure- 
ment of the block, in each of which a cat state 
interacts with the data. 

Combining together the probability per 
qubit of a Z error caused directly by a fault 
with the probability of a Z error that propa- 
gates from the cat state, we conclude that the 
error probabilities for the X L measurements 
can be bounded as 



KAM L X ) < [n(2r + + 3r + 2){e + e>) + 6nrrV] (m+1)/2 . (8) 



As explained in Sec. |IV F| , there are no time 
steps in which "resting" qubits are subject to 
"storage errors." 

For the M zz to fail, more than half of the 
n weight-2m row parity measurements must 
have errors. An X error acting on any one 
of the 2m qubits in the same row of the two 
blocks, either before or during the measure- 
ment, could flip the parity of the row. Be- 
fore the M zz is completed, each qubit par- 
ticipates in 2r + CZ gates during the V\+) op- 
eration, r CZ gates during the M zz , and in 
the worst case another 2r CZ gates during the 
M^ zz and M zz in the preceding CNOT gad- 
get. Thus the probability per qubit of a non- 



diagonal error is (2r + + 3r)e'. There might 
also be an X-type Pauli frame error in the 
\+) L preparation, but we do not have to count 
that separately; if the preparation "succeeds," 
then a Pauli-frame X error afflicts a qubit in 
the block only if a non-diagonal fault damaged 
that same qubit during the preparation circuit. 

In the absence of non-diagonal faults, the 
row parity measurement might still fail be- 
cause at least one Z error acts on the cat state 
in each of (r + l)/2 of the repeated measure- 
ments. If the cat state has length p, then in 
each repetition of the measurement there are 
2m + 2p + 2r'p circuit locations (2m CZ gates 
coupling the ancilla to the data, plus p qubit 
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preparations, p qubit measurements and 2pr' 
CZ gates for the cat state preparation). We 



conclude that the failure probability can be 
bounded as 



V 2 

Similarly, 



2m(2r+ + 3r)e' + ( r±1 ) [(2m + 2p + 2pr')(e + e') 



Ml(r+l)/2 



(n+l)/2 



(9) 



KJMzzz) < [ n+1 
2 



3m(2r+ + 4r)e' + ( P +i ) K 3m + 2 P + 2 P r ')( £ + e')] (r+1)/2 



(n+l)/2 



(10) 



(The Mzzz 1S preceded by a M% z acting 
on one of its input blocks inside the current 
CNOT gadget, plus additional Z-type logical 
measurements in the preceding CNOT gadgets 
acting on another one of its input blocks.) 



B. Cat state preparation failure 

Now it remains to estimate the probabil- 
ity of failure for the cat state and the logi- 
cal state \+) L - Recall that the preparation 
fails if the syndrome measurement is faulty 
in every round that yields the "winning" syn- 
drome outcome. The faults can be either non- 
diagonal or diagonal, but there must be at 
least two faults; otherwise only one syndrome 
bit would be wrong, the result would fail the 
parity test, and the syndrome would be re- 
jected. (A round with a single non-diagonal 
fault might be accepted if the fault flips a syn- 
drome bit while also simultaneously applying 
an X error to one of the cat state qubits, which 
flips a second syndrome bit measured in the 
following step. However, in this case the mea- 
sured syndrome matches the actual cat state 
syndrome at the end of the round, and there- 
fore is not counted as a failure.) 

Suppose that the winning syndrome occurs 
t times, and that there are u additional rounds 
that each contain at least one fault (some of 
these rounds might be rejected). Suppose that 
non-diagonal faults occur in s of the r' rounds 
of syndrome measurement; these faults can al- 



ter the syndrome. There are r' — t — u rounds 
without any faults, and the number of distinct 
syndromes detected in these rounds is at most 

8+1. 

Now we can use the pigeonhole principle to 
obtain a lower bound on t, expressed in terms 
of r' , u, and s. There are at most s + 1 valid 
syndromes that can occur in syndrome mea- 
surement rounds that have no faults. Com- 
bining these with the winning syndrome, there 
are at most s + 2 possible syndromes that can 
occur in the r' — u accepted rounds which ei- 
ther have no faults or produce the winning out- 
come. Of these s + 2 syndromes, the winning 
syndrome must occur at least as many times 
as any other syndrome; hence 



t > 



(11) 



where \x\ denotes the smallest integer greater 
than or equal to x. 

To bound ferrO^lcat})) we sum over s an( l u 
in each cat state preparation step, estimating 
the number of possible fault histories using the 
upper bound Eq. (Ill on i. In the first of the 



t winning rounds, a particular winning syn- 
drome is found, which differs in at least two 
bits from the actual syndrome in the begin- 
ning of that round. Then this same syndrome 
is found again in each of the remaining winning 
rounds. The sum over all possible winning 
syndromes, weighted by their probabilities, is 
bounded above by the probability that at least 
two measured syndrome bits are faulty in the 
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first of the t winning rounds. Each ZZ mea- 
surement is performed using one |+) prepara- 
tion, two CZ gates, and one X measurement; 
therefore the probability of error in the mea- 
surement of a single syndrome bit is bounded 
above by 4e + 2e' , and the probability that at 
least two syndrome bits are in error is bounded 
above by g)(4e + 2e') 2 - 

In each of the s rounds that contain X er- 
rors, we must sum over all the possible X- 
error patterns that can occur in that round. 
The sum over all X-error patterns, weighted 
by the probabilities, is bounded above by the 
probability that at least one nondiagonal fault 
occurs in that round. Since the round con- 
tains 2p CZ gates, this probability is in turn 
bounded above by 2pe' . 

Once the X-error pattern has been chosen 



in each round that contains X errors, we know 
the actual syndrome at the beginning of each 
round. And once the winning syndrome is cho- 
sen in the first winning round, we know which 
syndrome bits must have errors in each of the 
remaining t — 1 winning rounds. If the cat 
state preparation fails, then by definition at 
least two syndrome bits have errors in each 
of these rounds; hence each winning round af- 
ter the first has a probability weight bounded 
above by (4e + 2e') 2 . 

Taking into account that the rounds with 
non-diagonal faults can be chosen in at most 
( r ) ways, and enumerating the ways to choose 
which t rounds produce the winning syndrome 
and which u additional rounds have faults, we 
obtain 



Pcr^cat)) < -EEC"' J f U ! t VoV^ + 2 £ ') 2t (4^ + 2 P e') U (2p £ ') i , < L2) 



and similarly 

PerrCP|+>)<mEE 



r + r + 



s J \u + t J \ u 



s I \u + t J V u 



(4e + 2e') 2 *(4ne + 2ne') u {2ns') s , (13) 



where t denotes r^jrrf 1 ]. The prefactor nr in 



Eq. (12) arises because we use n length-p cat 



states in each measurement, and each mea- 
surement is repeated r times. The prefactor 
m in Eq. ( 13 ) arises because the encoded state 
\+) L is a product of m length- n cat states. 

We should add another contribution to the 
failure probability, because if half of the qubits 
(or more) have X errors, we might decode the 
cat-state syndrome incorrectly. A syndrome 
of the repetition code points to two possible 
X error patterns, one low weight and one high 
weight. We always assume the low- weight in- 
terpretation is correct, so if the high- weight 
interpretation is actually correct, then a Pauli- 
frame error X®p is applied to the cat state (or 
worse if the cat state qubits are used multiple 
times). The additional contribution, then, is 



bounded by the probability that each of \p/2\ 
qubits in the cat state are each hit by X er- 
rors at least once in at least one of the cat 
state preparations. This upper bound is 



P 

b/21 



(2rV) 



fp/21 



(14) 



The Kraus operator arising from a general 
non-diagonal fault could be a coherent linear 
combination of Pauli operators; therefore the 
pattern of X errors acting on the cat state may 
be a coherent superposition of many possibil- 
ity. For convenience and clarity, we assumed 
in the above discussion that non-diagonal er- 
rors are actually Pauli errors, so that X-error 
patterns can be assigned probabilities rather 
than amplitudes. But the argument also ap- 
plies to more general non-diagonal faults. - 
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by expanding each Kraus operator in terms 
of Pauli operators, and noting that all syn- 
drome bits are measured in the X basis, we 
can bound the probability for each syndrome 
measurement outcome in terms of e and e' as 
above. 



C. Shorter ancillas 

Cat states are used in measurements of 
the weight-2m logical operator ZZ L and the 
weight-3m logical operator ZZZ L . In the 
derivation of Eq. (|8| we assumed that the cat 
states have length 2m and 3m respectively, so 
that each of the cat state's qubits participates 
in just one CZ gate during the measurement. 
But we have found that the CNOT gadget may 
be more reliable if a shorter cat state is used 
instead. 

If the length of the cat state is at least m, 
then each cat-state qubit interacts with only 
one qubit in each encoded block. But if the 
length is p < m, then a single cat-state qubit 
may interact with as many as 

R = \m/p\ (15) 

data qubits in a single block, and hence an X 
error on that cat-state qubit may propagate 
to produce Z errors in R distinct columns of 
one block. If k of the cat state's p qubits have 
X errors, these could generate Z errors in kR 
columns of the block; thus an X L measure- 
ment on the block could fail if there are 

L(k) = (m + l)/2-Rk (16) 

additional columns that each contain at least 
one Z error. 



In each measurement, each data block in- 
teracts with the ancilla for R consecutive time 
steps, where in each step at most p CZ gates 
act on the data block. The repeated measure- 
ments can be staggered as described in [5], to 
avoid time steps in which data qubits are idle. 

We use the index % g {1, 2, 3, . . . ,p} to label 
the positions of the p qubits in the cat state, 
and design the measurement circuits so that 
the cat-state qubit i interacts with the same 
set of data qubits in each one of the measured 
code blocks. Since the ZZ L measurement is 
performed on all n rows of each block, and 
also repeated r times, inside a M.^ z there are 
at most 2nrR CZ gates that act on a cat-state 
qubit at position i (not counting CZ gates in 
the cat state preparation step), any of which 
could have a non-diagonal fault. Similarly, in- 
side a M. Z zz there are at most 3nrR CZ gates 
that act on a cat-state qubit at position i. 

The M x inside a CNOT gadget is preceded 
by Mzzz or M-zz within the CNOT, and in 
the worst case by both Mzzz an d Mzz m the 
preceding gadget, so at most there are 8nrR 
CZ gates in these measurements where a non- 
diagonal fault could disturb cat-state qubit i. 
An X error acting on cat-state qubit i might 
also occur because of an error in the winning 
syndrome in one of the cat-state preparation 
steps, which can occur only if a faulty CZ 
gate acts on qubit i at least once during the 
r' repeated syndrome measurements. Noting 
that a damaged cat-state qubit in position % 
could cause Z errors in R columns of the en- 
coded block, and that k damaged qubits in the 
length-p cat state can be chosen in (?) ways, 
we conclude that if length-p cat states are used 
then Eq. qSJ) should be replaced by 



fen 



k=0 



r M K ) < ^ (f) [SnrRs' + &nrr'e'] k ( ™ J [n(2r + + 3r + 2)(e + e')f [k) , (17) 



where VI. GEOMETRICALLY LOCAL 

CIRCUITS 

k max =\(m + l)/2R]. (18) 

In our analysis so far, we have assumed that 
the two-qubit CZ gates can be performed on 
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any pair of qubits, with an error rate indepen- 
dent of the distance between the qubits. Now 
we will consider how the analysis is modified 
if CZ gates can be performed only on neigh- 
boring pairs of qubits. 

There are a variety of possible architec- 
tures for fault-tolerant quantum computing 
using Bacon-Shor codes. To be concrete, we 
will consider an effectively one-dimensional ar- 
rangement, in which logical qubits are encoded 
in a ribbon of physical qubits with constant 
width in the vertical direction and length in 
the horizontal direction proportional to the 
total number of logical qubits. Each logical 
qubit lives in a "bi-block" — a pair of Bacon- 
Shor code blocks, one storing the data pro- 
cessed by the computation, and the other used 
as an auxiliary block for teleporting gates. If 
desired, the data block can be shuffled back 
and forth between the left and right sides of 
the bi-block via a "one-bit-teleportation" cir- 
cuit consisting of a |+) L preparation, a ZZ L 
measurement, and an X L measurement, as in 

FgT5| 

Each nxm logical block is interlaced with an 
nx (2m— 1) array of ancilla qubits, as in Fig. 7 
Of the 2m — 1 ancilla qubits in a row, m — 1 are 
used to read out the outcome of a ZZ measure- 
ment performed on the qubit's two horizontal 
neighbors, preparing an m-qubit cat state in 
the row. In addition, there is a column of n 
ancilla qubits at the boundary between two 
adjacent blocks, allowing the cat states to be 
extended to length 2m or 3m as needed for the 
measurements M^z an ^ ^zzz m ^ ne CNOT 
gadget. In the \+) L preparation gadget, n—1 
ancilla qubits in a column arc used to read 
out the outcome of a ZZ measurement on the 
qubit's two vertical neighbors, preparing an n- 
qubit cat state in a column. 

A CNOT gate is performed on a neighboring 
pair of logical bi-blocks using one of the vari- 



ants of the CNOT gadget depicted in Fig. 4 



Let us denote the bi-block on the left as bi- 
block 1 and the bi-block on the right as bi- 
block 2. If the data in bi-block 1 is on the 
right side of the bi-block and the data in bi- 
block 2 is on the left side, then we may use the 
CNOT gadget design in |Fig. 4ip , to perform a 
CNOT gate in either direction (i.e. with ei- 
ther logical qubit as the control and the other 
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FIG. 7: Arrangement of Bacon-Shor qubits for ge- 
ometrically local computation for 3x5 code block. 
Large blue circles indicate data qubits, small green 
circles ancilla qubits. Solid lines indicate locations 
between which CZ gates may be performed. Dot- 
ted lines separate qubits into code blocks. The 
ribbon may be extended arbitrarily far to the right 
and left with additional code blocks. 



as the target) using cat states of length 2m 
and 3m, with the output data from the gate 
appearing on the left side of bi-block 1 and the 
right side of bi-block 2. Otherwise, we may 
use one of the alternative gadget designs dis- 



cussed in Fig. 4 to deal with the cases where 
the data in bi-block 1 is on the left side and/or 
the data in bi-block 2 is on the right side. 
Whichever design we choose, the CNOT gad- 
get flips the data from one side to the other in 
both bi-blocks, and only the Mzzz measure- 
ment reaches across the boundary between the 
two bi-blocks. 

Note that with this method for performing 
a logical CNOT gate using geometrically local 
gates there is no need to swap the positions 
of pairs of physical qubits. The CZ gate is 
the only two-qubit gate used at the physical 
level, so that the assumption of highly biased 
noise remains physically plausible. Bacon- 
Shor blocks can be swapped using circuits of 
logical CNOT gates. 

To complete a universal gate set we also 
need to be able to inject noisy non-Clifford 
states into code blocks which can then be pu- 
rified by state distillation. This state injection 
can be performed using one-bit teleportation 
— the state is prepared using a single phys- 
ical qubit, while a logical block is prepared in 
the state \+} L , then the weight- (m + 1) oper- 
ator Z L Z is measured on the single qubit and 
a single row of the block, and finally the single 
qubit is measured in the X basis. This pro- 
cedure prepares the block in the state \tp) L , 
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up to a logical Pauli operator known from the 
measurement outcomes. To perform this task 
in a bi-block, we prepare | +) L on (say) the left 
side of the bi-block, and build a length-(m + 1) 
cat state using m of the ancilla qubits that ac- 
company a row of the block on the left side, 
and one additional ancilla qubit on the right 
side. The single-qubit state \ip) is prepared 
using one physical qubit on the right side of 
the bi-block which is adjacent to the (m + l)st 
qubit of the cat state, allowing the measure- 
ment of Z L Z to be complete using local CZ 
gates. 

If we wish to build a two-dimensional ar- 
chitecture, we may stack horizontal ribbons of 
logical qubits on top of one another. To per- 
form CNOT gates on pairs of logical bi-blocks 
that are stacked vertically, we need to be able 
to perform M.^ zz using cat states shared by 
vertically stacked blocks. For this purpose we 
can use ancilla qubits in neighboring bi-blocks 
to establish a "channel" linking the cat states 
neighboring corresponding rows of the verti- 
cally stacked blocks. 

We note that in the case of unbiased noise, 
a two-dimensional architecture might be real- 
ized more simply, using symmetric Bacon-Shor 
codes. In that case we can build a CNOT 
gate using either the gadget in |Fig. 4| or its 
Hadamard dual, with X and Z interchanged. 
The primal gadget uses horizontal cat states to 
execute CNOT gates on horizontally neighbor- 
ing blocks, while the dual gadget uses vertical 
(dual) cat states to execute CNOT gates on 
vertically neighboring blocks. 

For the geometrically local case, our analysis 
needs to be modified in several ways. As al- 
ready indicated, we will not be able to choose 
the length p of the cat state to optimize the 
gadget. Rather, the locality constraint re- 
quires that the length of the cat state match 
the weight of the measured operator (p = 2m 
for A4 Z z an d V = 3m for Mzzz)- Another 
change is that we must now consider the conse- 



quences of storage errors acting on idle qubits. 
Previously we assumed that the off-line prepa- 
ration of cat states can be scheduled so that 
the cat states are always available immediately 
when needed. But now there is only one set 
of m ancilla qubits to accompany each row of 
a data block. Each time a cat state is mea- 
sured we must prepare a new one, and the 
preparation involves a syndrome measurement 
repeated r' times. In each round of syndrome 
measurement, all qubits except the ones at the 
ends of the cat state participate in two ZZ 
measurements, and because the same ancilla 
qubits are used for two different ZZ measure- 
ments, four time steps are required for each 
one (a preparation, two CZ gates, and an X 
measurement). Therefore, the data waits for 
8r' time steps before the cat state is ready to 
interact with the data. While the data qubits 
wait for the cat state to be ready, they may be 
subject to storage errors. We must estimate 
how these storage errors contribute to the fail- 
ure probability of the logical measurements in 
the CNOT gadget. 

We will denote by e s the probability per 
time step of a diagonal storage error and by e' s 
the probability per time step of a non-diagonal 
storage error. How many storage steps are in- 
cluded depends on the gadget design, which 
in turn depends on whether the data is on the 
left or right side of the incoming bi-blocks. In 
the worst case, the measurement M.\ is per- 
formed on a block that undergoes four logical 
Z-type measurements between its preparation 
and measurement, two in the current gadget 
plus another two in the immediately preced- 
ing gadget, and for each of these measurements 
the data waits for 8r' steps as the cat state is 
prepared. If each Z-type logical measurement 
is repeated r times, then the total contribution 
to the Z error probability per qubit due to the 
identity gates in the circuit is 32rr'(e s + e' a ). 
Therefore, Eq. ^ is modified to become 



J 



PL(M L X ) < (j^) [n(2r+ + 3r + 2)(e + e') + 32nrr'(e s + e' s ) + 8nrrV] (m+1)/2 . (19) 
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Furthermore, a non-diagonal storage fault 
on any of the 2m data qubits can flip the out- 
come of M.\ z - This fault can occur in any 
of the 8r' time steps while a cat state is pre- 
pared in either the M zz or one of the preced- 
ing measurement. Similarly, a non-diagonal 
storage fault on any of the 3m data qubits can 



flip the outcome of M zzz , where his fault can 
occur in any of the 8r' time steps while the cat 
state is prepared in either the M zzz or one of 
the preceding measurements. Thus Eq. ( 9|10 ) 
are replaced by 



KAM L ZZ ) <(JM 2m(2r+ + 3r)e' + 2m(24rr'0 + f^A [(2m + 2p + 2pr')(e + £ ')] (r+1)/2 
2 L 2 

P C \AM L ZZZ ) <( jy 3m(2r+ + 4r)e' + 3m(32rrV.) + (^) [(3m + 2p + 2pr')(e + £ ')] (r+1)/2 



-i (n+l)/2 



-1 (n + l)/2 



(20) 



Note that the number of storage locations 
in the circuit might be reduced by adding ad- 
ditional ancilla sites, so that more measure- 
ments can be performed in parallel, and/or 
by combining together measurement locations 
with immediately following preparation loca- 
tions. 

Geometric locality also requires that we re- 
move the redundant cat state measurements 



that "wrap around" the code block; Thus 
there is no syndrome parity check; any syn- 
drome outcome is accepted, and a single di- 
agonal fault can generate a syndrome error. 
There are also storage locations where the 
cat state qubits wait while ancilla qubits are 
prepared and measured. Because of these 
changes, Eq. ( 12|13 ) are replaced by 



P er r(P|cat>) <™"££ ("J) (Jl t ) ( U * *) + + ^ + e' s )] t+u (2pe> + Aps' s ) s 

(21) 

and 

Perr(P\ +) ) < ™£ £ (^) ) (" + ') i^s + 2ns' + An(e s + s' s )] t+u (2ne'+Ans' s y. (22) 



VII. RESULTS FOR LOGICAL CSS 
GATES 

A. Unrestricted gates 

With our analytic upper bound on the effec- 
tive error strength f the CNOT gadget, 
we can find the optimum choice of the code 
block size n x m as well as the four CNOT pa- 



rameters r, r', r + and p, for any choice of the 
error strengths e and e' . We did this by brute 
force search over the parameter space. |Fig. 8| 
plots the results of this optimization for five 
choices of the bias R = e/e' . 

These results were obtained by optimizing 
the size of a Bacon-Shor code block which is 
used by itself, rather than as part of a more 
complex concatenated code. Alternatively, we 
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FIG. 8: Optimal CNOT logical error rate versus physical error rate for various values of the bias. Labels 
represent the n x m dimensions of the Bacon-Shor code block. For blocks with a single row, one can use 
the code studied in [8] which is a special case of the asymmetric Bacon-Shor code. 



may consider using our Bacon-Shor gadgets 
at the bottom level of a concatenated coding 
scheme. In particular, we may estimate the ac- 
curacy threshold for biased noise achieved by 
such concatenated codes, as was done in Ref. 
[5] for the special case of the n = 1 Bacon-Shor 
code (i.e., the repetition code). We have ex- 
tended the analysis of the accuracy threshold 
performed in [5] to more general Bacon-Shor 
codes. However, for bias above 10 3 we found 
no improvement over the threshold estimate 
in [5], because the n = 1 code turns out to 
provide the best threshold value. For values 
of the bias between 1 and 10 3 , the optimal 
value of n turns out to be greater than 1, and 
for that range of the bias we found modest im- 
provements in the accuracy threshold estimate 
compared to [8]. 

Using asymmetric Bacon-Shor codes, we can 
exploit the noise bias to improve the number 
of physical gates needed to construct a fault- 



tolerant CNOT gate with a specified logical 
shows this overhead factor 



Fig. 9 



error rate. 

and also indicates the dimensions of the op- 
timal code block. This plot includes a com- 
parison with the performance of concatenated 
codes surveyed in [7] for the case of unbiased 
noise. For highly biased noise, asymmetric 
Bacon-Shor codes achieve a much lower log- 
ical error rate with a smaller number of phys- 
ical two-qubit gates, compared to the perfor- 
mance of these previous constructions for un- 
biased noise. 



B. Geometrically local gates 

Optimizing the parameters for the case of 
geometrically local gates, assuming the storage 
error rate is negligible, we obtain the results 
displayed in |Fig. 10| and |Fig. 11| Enforcing 
locality significantly weakens the performance 
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FIG. 9: Optimal CNOT logical error rate versus required number of physical gates for nxm asymmetric 
Bacon-Shor codes, for a physical error rate e = 10 -4 at various values of the bias, plotted alongside 
results for codes studied in [7] with no bias. 



of our constructions — at a bias of 10 4 and 
dephasing error strength of e = 10~ 4 , the op- 
timal effective error strength we can achieve 
increases from 10~ 20 to 1CP 9 , while requir- 
ing roughly 8 times as many gates. There- 
fore, in the geometrically local case our esti- 
mated logical error rate for asymmetric Bacon- 
Shor codes and highly biased noise is roughly 
similar to what can be achieved using surface 
codes with a similar number of logical gates, 
while disregarding the bias [HI [12] . The most 
important reason that nonlocal gates prove 
to be advantageous is that if geometry is ig- 
nored then the cat state used in (for exam- 
ple) the measurement M. zzz can be chosen to 
be much shorter than the length-3m cat state 
used in our geometrically local construction. 
The number of physical qubits per block re- 
quired to achieve the optimal logical error rate 
stays relatively small, as indicated in |Fig. 12[ 

Our upper bound on the logical error rate 



is likely to be far from optimal, and we also 
expect that the performance could be sub- 
stantially improved by using a more sophis- 
ticated (but harder to analyze) method like 
perfect matching for decoding the cat state 
syndrome history. Suitable lattice deforma- 
tions may also make it more feasible to reduce 
the length of cat states substantially, yield- 
ing further enhancements in performance for 
the case of geometrically local gates. Such 
improvements might make asymmetric Bacon- 
Shor codes more competitive relative to sur- 
face codes. 



Geometrically local gates and 
measurement bias 



In some experimental settings measure- 
ments are noisier than gates, in which case 
we say the noise has "measurement bias." We 
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FIG. 10: Optimal CNOT logical error rate versus 
physical error rate for various values of the bias, 
where two-qubit physical gates are assumed to be 
geometrically local. 
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FIG. 11: Optimal CNOT logical error rate versus 
required number of physical gates for geometri- 
cally local asymmetric Bacon-Shor codes at var- 
ious values of the bias, for a physical error rate 

e = i(r 4 



may consider a noise model in which the de- 
phasing error rate e in diagonal gates exceeds 
the rate e' for nondiagonal faults, while in ad- 
dition the error rate e^f for single-qubit mea- 
surements exceeds e. (We also continue to as- 
sume that e is the error rate for single-qubit 

our geo- 



preparations.) As shown in Fig. 13 
metrically local asymmetric Bacon-Shor code 
gadgets are somewhat robust against increas- 
ing measurement bias, because our gadgets 
contain considerably more gates than mea- 
surements (or preparations). Relative to the 
case Em — the performance of the logi- 



FIG. 12: Bacon-Shor block size vs optimal (in 
number of gates) CNOT logical error rate for geo- 
metrically local asymmetric Bacon-Shor codes at 
various values of the bias, for a physical error rate 
e = 10 -4 . 
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FIG. 13: Optimal CNOT logical error rate versus 
required number of gates for geometrically local 
asymmetric Bacon-Shor codes, assuming a physi- 
cal error rate e = 10~ 4 , bias 10 4 , and various rates 
of the measurement error rate 6m- 



cal CNOT gate is not much affected as the 
measurement bias £m/£ rises to about 5. In 
contrast, surface-code gadgets, which contain 
a higher number of measurements relative to 
the number of gates, are more sensitive to mea- 
surement bias. 
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VIII. STATE INJECTION 

Having analyzed the performance of our 
fault-tolerant CSS gates, we now turn to 
the state injection and distillation protocols 
needed to complete a universal set of fault- 
tolerant gates. An arbitrary single-qubit state 
\ip) can be injected into the Bacon-Shor block 
using the "one-bit teleportation" circuit de- 
picted in Fig. 5 Here a logical qubit is pre- 
pared in the state |+) L , the two-qubit mea- 
surement M. zz is performed jointly on an un- 
protected qubit and the Bacon-Shor block us- 
ing a length- (m + 1) cat state, and finally the 
A-basis measurement Mx is performed on the 
unprotected qubit. To estimate the error in 
the state injection step, we should consider all 
the ways in which the outcomes of the mea- 
surements M zz and Mx might deviate from 
their ideal values. 

In the state distillation circuit, this injec- 
tion step is directly followed by a CNOT gate. 
Hence some sources of error in the teleporta- 
tion circuit need not be attributed to the state 
injection step, as they are already included in 
our error estimate for the CNOT gate that fol- 
lows. In particular, the possibility of failure in 
the preparation of \+) L or a cat state is in- 
cluded in Eq.Q, and conservatively at that, 
since we assumed there that the cat states had 
length 2m or 3m rather than m + 1. Here we 
assume that the number of repetitions r' of 
the cat state syndrome measurement, and the 
number of syndrome measurement repetitions 



r + in the |+) L state preparation, match the 
number of repetitions in the following CNOT 
gate. If so, we may bound the injection error 
probability P m {J) by 



(23) 

where e^, bounds the probability of an error in 
the preparation of the unprotected state 
(we presume that this error is not necessarily 
diagonal in the Z basis). 

We assume that M zz is repeated r times 
to improve its reliability. In the first mea- 
surement we can time the preparation of the 
unprotected qubit and the cat state so that 
there are no storage errors prior to step in 
which the cat state interacts with the data. 
But in subsequent measurements storage er- 
rors on the data may accumulate during the 
8r' time steps while the cat state is prepared 
for the next round. To estimate P* rr (Mx), we 
note that each of the m + 1 cat-state qubits is 
acted upon by two CZ gates during each of 
r' rounds of cat-state syndrome measurement, 
where a nondiagonal fault in any of these gates 
could result in a Pauli-frame A error. Further- 
more, a storage fault acting on the unprotected 
qubit, a fault in final A measurement of the 
unprotected qubit, or a fault in the CZ gate 
that couples the unprotected qubit to the cat 
state could cause Mx to fail. Therefore we 
find 



P*„(Mx) < (r + l)(e + e') + 8(r - l)r'(e s + e' s ) + 2(m + l)rrV 



(24) 



A nondiagonal error acting on any data qubit 
can flip the outcome of M zz . Otherwise, a 
diagonal fault in each of at least (r + 1)/2 mea- 
surement rounds could cause the M zz to fail. 
Since in each round there are m + 1 gates cou- 



pling the qubits to the data, (m-(-l)r' CZ gates 
in the cat state preparation, as well as m + 1 
single-qubit preparations and measurements, 
we find 
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PL{M L ZZ ) < re' + 8r'(r - 1)4 + m(2r + + r)e' + [(m + l)(r' + 3)(e + e')] (r+1)/2 - (25) 
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FIG. 14: Injection error probability for the op- 
timal geometrically local Bacon-Shor codes found 
in |Fig. 10| with the probability of an error on the 
unprotected state \ip), ey,, set to be equal to the 
dephasing error probability e. 



|Fig. 14| bounds the injection error probabil- 
ity assuming the values of m, r and r' cho- 
sen to optimize the geometrically local CNOT 
gadget. For large biases the injection er- 
ror probability is around 1%, well below the 
thresholds for successful distillation, as will be 
seen in Sec. IIXI 

By choosing r = 1 (no repetition of the ZZ L 
measurement), we can avoid storage locations, 
and the leading contribution to P e ir{S) linear 
in e (assuming e' <C e) is 



Perr(J) = 3e + (m + 1)(V + 3)e + 



(26) 



Increasing to r = 3 entails increasing the sensi- 
tivity to storage errors, while providing better 
protection against diagonal errors in CZ gates: 



P e „(J) = 5e + 16r'e s 



(27) 



which might be a significant improvement if 
e s < e. 

The probability of error in state injection 
depends on m and also on r', the number of 
times the cat-state syndrome measurement is 
repeated, which we assume matches the value 



of r' in the CNOT L gate that follows the state 
injection step. We note, though, that the value 
of r' might increase gradually as state injec- 
tion proceeds. If there are multiple rounds of 
state distillation, we might be willing to ac- 
cept a larger CNOT L error rate P orr (CNOT L ) 
in early rounds where the error in the dis- 
tilled state is higher, with P orr (CNOT L ) de- 
clining in later rounds as the state's purity im- 
proves. Adjustment of the Bacon-Shor block 
size, and hence of P err (CNOT L ), is easy to in- 
corporate in our circuit constructions, as there 
is no need for the control and target blocks in 
the CNOT L circuit to be of equal size. Using a 
smaller code in early rounds may save on over- 
head, but more importantly reducing the value 
of m and r' used in the first round improves 
the error P elI {J) of the initially injected state. 
However, for simplicity, we did not invoke this 
strategy of gradually increasing the block size 
in the analysis of state distillation reported in 

sec. inn 



IX. STATE DISTILLATION 

So far we have seen how, using asymmetric 
Bacon-Shor codes, to perform fault-tolerant 
encoded versions of the operations in the "CSS 
set" <?css : tne CNOT gate, preparations of 
the encoded states |0) L and |+) L , and the 
measurements A4^, Ai^. To perform fault- 
tolerant universal quantum computation, we 
will need in addition to prepare high-fidelity 
encoded versions of the states 



|T) 



1 

71 



(|0> + i|l», 



(28) 



Using a | +i) ancilla state and CSS operations, 
we can teleport the Clifford group gates Q = 
exp (i~ X) and S = exp (— i\Z\ which suffice 
for generating the full Clifford group. Using a 
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\T) ancilla, the S gate, and CSS operations, 
we can teleport T — exp (— if^), completing 
a universal gate set. 

To prepare these encoded ancilla states, we 
first prepare noisy versions of the encoded 
states, and then use a distillation protocol to 
generate the needed high-fidelity versions of 
these states [16]. To distill the ancillas we 
need only CSS operations, and can use a cir- 
cuit based on the [[7,1,3]] Steane code; it takes 
7 noisy (|— i)) ancillas and produces one 
clean |— i) ancilla. The |T) distillation 

protocol uses both CSS operations and 
ancillas and is based on Reed-Muller codes 
[TB] : it produces one clean |T) ancilla from 15 
noisy ones. Other, more efficient, distillation 
protocols have been proposed recently [TM2"U] . 

10° | . , , , 




3 



FIG. 16: Output error probability after n lev- 
els of \T) distillation, with the values of ei n and 
£ C ss taken from the optimal values of |Fig. 14| for 
e = 10" 4 and various biases. Only two rounds of 
distillation are required in each case to reach the 
floor. 
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FIG. 15: Output error probability after n levels of 
\T) distillation, starting from e- ln = 0.15, for var- 
ious values of ecss- The \+i) ancillas are assumed 
to be maximally distilled, so that £|+j) = 4e css . 
Eventually the output probability levels off to 
around 8e C ss- 

In the original analysis 16] of the perfor- 
mance of these state distillation protocols, it 
was assumed that the CSS operations were 
perfect. Here we want to consider the case 
in which the CSS operations, protected by an 
optimally chosen Bacon-Shor code, are them- 
selves noisy. We provide more details about 
our analysis in [21]; state distillation using 
noisy CSS operations has also been discussed 
previously in, for example, Refs. [TTl H2l |2"2"] . 

When the CSS operations are perfect, we 
can attain ancilla states with arbitrarily low 
error by performing a sufficient number of 



rounds of distillation, assuming that the fi- 
delity of the initial ancilla with the desired 
ideal state exceeds a threshold value. How- 
ever, when the CSS operations are noisy, the 
error rate e css for the CSS operations sets a 
nonzero floor on the error in the output an- 
cilla states — once this floor is reached, fur- 
ther rounds of distillation produce no further 
improvement, because the error in the output 
is dominated by the errors in the distillation 
circuit rather than the errors in the input an- 
cilla states. We have designed fault-tolerant 
distillation circuits that minimize this noise 
floor, and have found the corresponding lower 
bounds on the output error for distillation of 
both \+i) and |T) states [ST]- For distil- 
lation, the best achievable error rate for our 
procedure is 

e l+i) >4s css + 0(e 2 csa ), (29) 

and for |T) distillation it is 

£|T) > 8e C ss + 0(£css)- (30) 

A single round of the noisy distillation pro- 
tocol improves the ancilla error probability 
substantially, provided that the CSS error rate 
£ css in the distillation circuit is small compared 
to the input error probability e; n of the noisy 
ancilla states, and also that e- ln is safely below 
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the (quite high) threshold error rate for the 
ideal distillation protocol. The convergence of 
the ancilla error rate to the floor « 8e css is il- 



lustrated for \T) distillation in Fig. 15 for the 
case where the error rate of the input ancillas 
is £i„ = 0.15. 

For initial ancillas with errors that are very 
close to the threshold of the ideal distillation 
protocol, it can in principle take many rounds 
of distillation to reach the floor set by the CSS 
error rate. On the other hand, if the initial an- 
cillas are not too noisy, then it will take just 
a few rounds of distillation to reach this floor. 
In |Fig. 16| the error rate of the output an- 
cilla is plotted as a function of the number 
of rounds of distillation, where the input er- 
ror rate is the injection error probability esti- 
mated in |Fig. 14| For the parameters shown, 
only two rounds of distillation suffice to reach 
the floor set by the logical CSS error rate. This 
relatively modest overhead cost for executing 
non-Clifford logical quantum gates is one of 
the advantages of our fault-tolerant scheme. 
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